A Query Language and User Interface for XML Information Retrieval
نویسندگان
چکیده
As XML is about to become the standard format for structured documents, there is an increasing need for appropriate information retrieval (IR) methods. Since classical IR methods were developed for unstructured documents only, the logical markup of XML documents poses new challenges. Since XML supports logical markup of texts both at the macro level (structuring markup for chapter, section, paragraph and so on) and the micro level (e.g., MathML for mathematical formulas, CML for chemical formulas), retrieval methods dealing with both kinds of markup should be developed. At the macro level, fulltext retrieval should allow for selection of appropriate parts of a document in response to a query, such as by returning a section or a paragraph instead of the complete document. At the micro level, specific similarity operators for different types of text or data should be provided (such as similarity of chemical structures, phonetic similarity for person names). Although a large number of query languages for XML have been proposed in recent years, none of them fully addresses the IR issues related to XML; especially, the core XQuery proposal of the W3C working group [4] offers no support for IR-oriented querying of XML sources; the discussion about extensions for text retrieval has started only recently (see the requirements document by Buxton and Rys [5] and the use cases by Amer-Yahia and Case [2]). There are only a few approaches that provide partial solutions to the IR problem, namely by taking into account the intrinsic imprecision and vagueness of IR; however, none of them are based on a consistent model of uncertainty (see section 5). In this paper, we present the query language XIRQL which combines the major concepts of XML querying with those from IR. XIRQL is based on
منابع مشابه
Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica
Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...
متن کاملXML Retrieval with a Natural Language Interface
Effective information retrieval in XML documents requires the user to have good knowledge of document structure and of some formal query language. XML query languages like XPath and XQuery are too complex to be considered for use by end users. We present an approach to XML query processing that supports the specification of both textual and structural constraints in natural language. We impleme...
متن کاملXOR - XML Oriented Retrieval Language
The wide acceptance and rapidly growing use of XML as a standard storage and retrieval data format blurs the historical divide that exists between Information Retrieval and Database Retrieval. On the structured database retrieval side it is now possible to support highly structured access to documents using XML specific tools such as XPath, XQuery, XQL and more. On the information retrieval sid...
متن کاملA User Interface for XML Document Retrieval
XML document retrieval requires new ideas for user interface design. The query language provides primitives for dealing with the tree structure, which needs to be reflected in an interface for query formulation. Further, the XML structure is also reflected in the retrieval results, where items may contain each other. In this paper, we present a user interface for formulating queries in an XPath...
متن کاملComparing XML-IR Query Formation Interfaces
XML information retrieval (XML-IR) systems differ from traditional information retrieval systems by using structure of XML documents to retrieve more specific units of information than the documents themselves. Users interact with XML-IR systems via structured queries that express their content and structural requirements. Historically, it has been common belief within the XML-IR community that...
متن کاملQEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches
A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003